Reinforcement learning for phoneme recognition

نویسندگان

  • Akira Ichikawa
  • Tomoyuki Shimizu
  • Yasuo Horiuchi
چکیده

In a spontaneous spoken dialogue understanding system, real-time response and robustness to the environment are required. To realize these requirements, we adopted a multi-agent system architecture. In this paper, we propose a reinforcement learning method for a phoneme recognizing agent as a sample agent, and adopt a continuous dynamic programming technique to deal with continuous phoneme recognition. To clarify the fundamental characteristics of the proposed method, we de ne some simple quasi conditions for the experiments, and con rm favorable results. The system can be expected to achieve high adaptability to the environment (e.g., variation of speakers and tasks) and robustness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Reaction Time in Phoneme Recognition: A Comparative Study among Iranian Upper-Intermediate vs. Advanced EFL Learners at Institute Level

The present study aimed to investigate of reaction time in terms of phoneme recognition: A comparative study among Iranian Upper-Intermediate vs. Advanced EFL Learners at Institute level. The main question this study tried to answer was whether there is no difference in reaction time in terms of phoneme recognition in Iranian learners at Institute level. To answer the question, 5Upper-Intermedi...

متن کامل

Chapter 7. Evolving Connectionist and Fuzzy - Connectionist Systems: Theory and Applications for Adaptive, On-line Intelligent Systems

The paper introduces one paradigm of neuro-fuzzy techniques and an approach to building on-line, adaptive intelligent systems. This approach is called evolving connectionist systems (ECOS). ECOS evolve through incremental, online learning, both supervised and unsupervised. They can accommodate new input data, including new features, new classes, etc. New connections and new neurons are created ...

متن کامل

Phoneme-to-grapheme conversion for out-of-vocabulary words in speech recognition

In this report, we show that Out-Of-Vocabulary items (OOVs), recognized using phoneme recognition, can be reasonably reliably transcribed orthographically using Machine Learning techniques. More specifically, (i) we show baseline performance of a machine learning approach to phoneme-to-grapheme conversion when different levels of artificial noise are added (simulating phoneme recognizer errors)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999